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I  -  INTRODUCTION 
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In  the  paper  "Counting  Large  Numbers  of  Events  in  Small  registers",  Morris 
[Mo  1978}  proposed  a  probabilistic  algorithm  for  keeping  track  of  a  large  number 
of  events  M  with  an  n  bit  binary  counter  where  typically  M  >>  2°-!  «  largest 
integer  that  can  be  represented  by  the  counter.  This  method  of  counting  has  been 
dubbed  "Approximate  counting"  by  Flajolet  [FI  1985]  who  reformulated  the  problem 
in  terms  of  a  discrete  time  Markov  chain  b(t)  with  state  space 

I*  -  (0,  1 ,  2,  . . . ] 

and  transition  function  given  by 


(1.1) 


P(b(t+1 )  =  i+1  I  b(t)  -  i)  =  2  ,  i  >  0 


P(b(t+1 )  -  i  I  bit)  -  i)  -  1-i 


The  process  b(t)  arises  naturally  when  one  "counts"  an  event  with 
probability  2  and  does  not  record  it  with  probability  1-2  b  where 

b(t)  =  current  count  in  the  register.  How  well  does  b(t)  track  t  ? 


Morris  [Mo  1978]  ha3  noted  that 
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and  Flajolet  [FI  1985]  has  given  a  proof  of  (1.2). 


Thus  2b^-2  is  an  unbiased  estimator  of  t.  In  addition  Flajolet  has  shown, 
Theorem  1  of  [FI  1985],  that 

(1.3)  a, (t)  <  E(b(t) )  -  log2  t  <  a2(t) 

where  a  (t)  are  small  and  bounded  as  t  ■*  «. 
i 

The  proof  of  (1.3)  given  in  [FI  1985]  is  not  simple  since  it  uses  Mellin 
transforms  and  other  refined  techniques  from  the  theory  of  functions  of  a  complex 
variable.  It  is  the  purpose  of  this  paper  to  derive  (1.2)  and  (1.3)  by  means  of  a 
more  elementary  method,  at  least  more  elementary  to  probabilists  !,  using  only 
the  simplest  ideas  of  the  theory  of  martingales.  See  chapter  VI  of  [Ka-Ta  1975] 
f or  a  more  comprehensive  account.  More  precisely,  we  shall  prove  that  (1.3)  holds 
with 


-1 


a,(t)  *=  log2  ( 2t  ♦  log2) 


and 


-1 


a2(t)  =  log2  ( 1  ♦  2t  ) . 


In  addition  we  derive  a  new  recursive  formula  for  E(  2^^  b  ^ ) ,  l  2  1 


I nteresting^jj  enough  the  same  process  b(t),  but  with  b(l)  =  0,  occurs  in  a  I  1 
recent  paper  by  [GGMM  1985]  where  b(t)  represents  the  back  off  counter  occuring 
in  the  exponential  back  off  protocol  (EBO)  when  the  channel  is  always  jammed  ;  in 
fact,  this  is  what  first  stimulated  my  interest  in  this  problem  -  see  [Ro  198^]. 

The  problem  here  is  to  show  that 

y 

Ui  i  iJ  i.U  ■  / 


lim  sup  t  E(?”b(t))  < 


t  ♦  ro 

In  fact  the  following  result  was  proved  in  [GGMM  1985] 
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(1.4)  1/8  £  t  E(2_b(t))  £  9,  for  all  t  *  1. 

By  a  simple  martingale  argument  we  are  able  to  obtain  the  sharper  lower 

bound 

(1.5)  1  *  t  E(2_b(t)) 

and  with  a  little  more  effort  we  are  able  to  show 

(1.6)  t  E(2~b(t))  S  4.1 ,  t  >  1 . 

In  addition  [GGMM  1985]  conjectured  that 

(1.7)  E (2  b^)  ~  c  t  1  where  c  »  1/log2. 

In  fact  Flajolet  has  shown  [FI  1986]  that  (1.7)  must  be  modified  tt>  take 
into  account  bounded  fluctuations  w(t)  of  small  amplitude  i.e.  he  shows  that 

(1.8)  E(2  b(t))  »  c  t  1  ♦  w(t). 


The  proof  of  (1.8)  is,  as  is  to  be  expected,  quite  delicate.  Using  only 
very  simple  tools  we  are  able  to  prove  the  following  weak  form  of  conjecture 
(1.7) 

t-1  .  . 

I  a)  lim  sup  E  E(2  S  )/log  t  Sc 

t  +  «>  s  =  l 

t-1 

b)  lim  inf  E  E(2  3  )/log  t  5  1. 

t  *  <*  s=  1 


These  results  are  obtained  in  part  3- 

Acknowledgments  :  I  wish  to  thank  my  colleagues  F.  Baccelli,  G.  Fayolle,  P. 
Flajolet  and  P.  Robert  here  at  INRIA  for  several  helpful  discussions.  In 
particular  the  proof  of  the  lower  bound  (2.9)  is  due  to  P.  Robert. 


II  -  A  METHOD  FOR  CONSTRUCTING  MARTINGALES  ASSOCIATED  WITH  A  MARKOV  CHAIN 

The  following  discussion  assumes  the  reader  is  familiar  with  chapter  VI  of 
Karlin-Taylor.  Let  y(t)  be  a  Markov  chain  with  countable  state  space 
I+  -  {0,  1,  2,  ...}  and  transition  matrix  P  -  (P(i,j)).  Let  J^(t)  -  d$(y(u), 

0  £  u  £  t)  denote  the  smallest  o-field  generated  by  the  random  variables  y(u), 
0  £  u  £  t.  Intuitively  f^(t)  is  the  past  up  until  time  t.  We  recall  the 

Definition  :  we  say  that  x(t)  is  a  martingale  with  respect  to  (t) 
if  (i)  x(t)  is  ,J^(t)  measurable 

(2.1 )  (ii)  E(|  x ( t )| )  <  «  and 

(iii)  E(x(t+1  )  |  t$£(t))  ■  x ( t ) . 

In  order  to  construct  martingales  associated  with  the  Markov  chain  y(t)  we 
make  use  of  the  operators  P  and  A  acting  on  the  function  space 

00 

D  -  (f  |  f  :  l  -  R,  Z  P(i.j)  |  f(J)|  <  -} 

J-0 

Definition  :  Let  f  6  D 
00 

(i)  Pf ( i )  A  z  P(i,j)  f ( j ) 

“  j-0 

(2.2)  (ii)  A f(i)  A  Pf(i)  *  f(i) 

CD 

(iii)  P|  f|  (i)  A  Z  P(i,j)  |  f|  (j). 

"  J-o 

Note  that  E{f(y(t+1))  -  f(y(t))|  ,j£(t)}  =  E{f(y(t+1))  -  f  (y(t)  )|  y(t) )  -  Af  (y(  t) ) 
and  using  this  fact  it  is  easy  to  establish  the  following 

Lemma  :  Suppose  Pj  f j  ( i )  <  ™  and  set 
t-1 

x(t)  =  f ( y ( t ) )  -  Z  Af(y(s)),  t  >  1  and  x(0)  =  f(y(0)). 

3*0 

Then  (x(t),  (t))  is  a  martingale. 


(2.3) 


05 


For  future  reference  we  note  the  following  special  case.  Suppose  Af(i)  -  c 
all  1  i  0.  Then  f(y(t))  -  ct  is  a  martingale. 

Lemma  (2.3)  yields  a  novel  and  computationally  simple  proof  of  Flajolet' s 
proposition  0  as  well  as  a  recursive  formula  for  the  higher  moments  of  2b^b\ 


(2.U) 


Proposition  : 

Let  b(0)  -  1.  Then  E(2b(t))  -  t*2  and  o2(2b(t))  -  t(tM}/2. 


Proof.  We  begin  by  noting  that 


(2.5) 


(2.6) 


Af(j)  -  2  J  Cf(j-M)  -  f(j)3. 


Let  f^(j)  ”  2  J  where  £  -  0,  1,  2, 


Af^j)  -  (2  -1)  f  (J  ) . 


. .  An  easy  calculation  shows  that 


In  particular  Af,(j)  -  1  and  therefore,  by  lemma  (2.3),  we  have 


f , (b( t) )  -El-  2b(t)-t 


is  a  martingale.  Consequently 


E(f , (b(t) )  -  t)  -  f  i  (b(0) )  -  f,(1)  -  2  i.e. 


(2.7) 


E(f , (b( t ) )  -  E(2  ')  -  t+2. 


Lemma  (2.3)  and  formula  (2.6)  can  now  be  used  to  derive  a  recursive  formula 
for  the  moments  E(2  ^  ^),  1  5  2.  To  illustrate  these  ideas  consider  the  case 
1  =  2.  Since  A T 2 ( j )  =  3 T t ( J )  lemma  (2.3)  and  (2.7)  yield 


E  ( f  2  ( b  ( t ) )  -  E  3f,(b(s))  =  f 2 (b(0) )  -  k 
3-0 


.V  / 


t-1  t-1 

E(22b(t))  -  E(  f  2  (b(t ) )  -  <4  +  3  T  E(f ,  (b(s) )  -14  +  3  I  (s+2)  = 


4  +  3  •  ( l~-2—  *  6.t  -  |  t( t+3)  +  14. 


o2(2b(t))  -  E(22b(t))  -  E(2b(t))2  t(  t+3)  ♦  *1  -  ( t+2 ) 2  =  l-(-^  }. 

Using  (2.3)  we  can  give  a  recurrence  formula  for  the  higlher  moments  of 

2b^b^.  More  precisely  let  M^(t)  -  E(2^'b^t^),  l  »  0,  1 . Then  lemma  (2.3)  and 

(2.6)  yield 


(2.8) 


M  (t)  -  2l  +  (2!'-1 )  Z  M  (s) . 
1  s-0 


Note.  We  believe  formula  (2.8)  to  be  new. 

The  formula  E(2b^)  -  t+2  together  with  Jensen's  inequality  applied  to  the 
concave  function  <j> ( t )  -  log2t  yield 

E(b(t) )  £  log, (t+2)  -  log2t  +  log( 1 +2t  1). 

We  now  proceed  to  derive  a  lower  bound  for  E(2b^b^)  using  an  argument 
suggested  to  the  author  by  P.  Robert  (INRIA). 

An  easy  calcuulation  shows  that 

E(b( t+ 1 )  -  b(t) )  =  E(2~b(t))  £  2_E(b(t)) 


and  therefore 


2E(b(t))  _  E(b(t+1 )  -  b(t))  >  t 


Z  2  E  ^  b  ^ S  ^  E  (  b  ( s  +  1  )  -  b(s))  Z  t. 


Set  4.(3)  -  E(b(s))  ;  so  4>(0)  -  1.  The  preceeding  expression  can  be  rewritten  as 


E  2*(S)  ( 4> ( s+ 1  )  -  4>(s))  *  t. 
s-0 

On  the  other  hand 

t  S  E  2*(S)  [41(3+1 )  -  4>(s)]  S  f  }  2Sds  -  -  e10g2)/log2  i.e. 

s-0  1 

2<{’(t)  2  2  +  (log2).t 
and  this  implies 

E  ( b  ( t ) )  -  4>(t)  >  log2  (t[log2  +  2t  '])  -  log2t  +  log2  [log2  +  2t  1]. 


Summing  up  then  we'  re  shown  that 

;\9.  log2t  +  log2 ( 2t” 1  ♦  log2)  S  E(b(t) )  S  log2t  +  log2(1  +  2t  1 ) 

E(b(t) ) 

Clearly  this  Implies  lim  -t~  -  1  and  also  that 

£  -»oo  2 


(.7.10) 


log2 ( log2)  S  E(b(t) )  -  log2t  £  1/t. 


Ill  -  AN  IMPROVED  ESTIMATE  FOR  E(2  b(t)) 


In  this  part  of  the  paper  we  assume  b(1)  =  0  and  repeating  the  same 

argument  of  proposition  (2.4)  we  see  that 

t-1 

f , (b(t))  -  E  Af , (b(s) ) 
s-1 

is  a  martingale  with 

t-1 

E(f , (b(t) )  -  E  Af , ( b( s ) )  =  f , (b( 1 ) )  =  1 . 

3*  1 

Consequently  E(2b^t^)  ■  t.  We  now  apply  .Jensen's  inequality  with  J>(x)  =  x  1  to 


deduce  the  much  sharper  lower  bound 


(3.1) 


E(2  £  1/t. 


To  obtain  the  upper  bound 


(3.2) 


E( 2~b( t ) )  *  4. i/t. 


We  derive  an  exponential  bound  on  the  first  passage  time 


Ty  -  inf{t  :  b(t)  £y) .  This  leads  to  a  sharper  estimate  than  the  one  obtained  by 


[GGMM  1985]  who  used  Chebychev' s  inequality  ;  otherwise  the  two  proofs  are  the 


Lemma  :  For  any  X  £  X(y)  -  -  log(1  -  2  ) 


(3-3) 


p(t  2  t)  £  exp(-Xt+X)  II  (1  -  (1  -  exp(-X))2  ) 

y  i-1 


Proof.  Let  y.  be  the  random  variable  with  geometric  distribution  given  by 
-i  1  -i  j-1 

P(y  =j)  =2  (1-2  )  ,  j  £  1.  Thus  its  moment  generating  function 


J  i  i  -i 

E(e  )  =  (1  -  (1  -  exp(-X))2  ) 


Clearly  x t  ■=  1  and  for  y  >  1  we  have 


y-i 

i  +  z  y . . 


Since  the  y^,  1  S  i  £  y-1 ,  are  mutually  independant  we  see  at  once  that 


M  (X)  -  E(exp(X  t  ))  »  exp(X)  n  E(exp(X  y^)) 


exp(X)  II  (1  -  (1  -  exp(-X))2  ) 
i  =  l 


By  Chebychev' s  inequality  exp(Xt)  P(t^  -  t)  <  NMx)  which  is  precisely 
lemma  (3-3).  Note  that  a  sufficient  condition  for 


M  (X)  <  «  is  sup  21 (1  -  exp(-X))  <  1 
y  1 < i <y- 1 


In  particular  if  we  set  X  =  X(y)  where 
(3.J0  A(y)  -  -  log(1  -  2_y ) 

then  My(X)  <  °°  for  X  £  A(y)  and 

(3.5)  log(1  -  (1  -  exp(-X))2i)  2  -  ( log2) ( 1  -  exp(-X))21, 

where  we'  ve  used  the  inequality 

log(l-x)  >  -(log2)x  on  0  ^  x  02. 

Our  next  step  is  to  set  x  »  A(y)  in  lemma  (3-3)  and  then  take  the  log  of 
both  sides  which  leads  to 

y- 1 

log  P( t  *  t)  S  -A(y)(t-1)  +  ( log2)  Z  21 ~y 
y  i  =  1 

S  - X ( y ) t  ♦  ( log2) ( 1  -  2~y)  *  X (y ) . 

Consequently 

P(Ty  -  t)  £  c(y)  exp(-A(y)t) 

where 

c(y)  =  exp{X(y),  +  (log2)(1  -  2  y)}  5  23/u,  y  Z  1. 

We  have  thus  derived  the  exponential  bound 

(3-6)  P(xy  2  t)  S  23/2  exp( - X ( y ) t ) . 

We  now  proceed  to  estimate  E(2  Set  m  =  [log2t]  so 

(log2t)-1  <  m  <log2t,  consequently 

E(2  b(t)  ;  b(t)  >  m)  <  2/t. 

Here  E(f  ;  A)  means  integrating  f  over  the  subset  A.  So  the  problem  row  is 


to  3how  that 


and  to  estimate  the  constant  a.  Now 


>-b(t) 


m-1 


(3-7)  E(2  uvw  ;  b(t)  £  m-1)  £  £  2  J  P(b(t)  £  j) 

j-0 


But 


and 


P ( b ( t )  S  j)  -  P(Tj  2  t)  S  23/2  exp(-X(j)t) 
-  AC j )  -  +  log(1  -  2_J)  S  -2_J 


together  imply 


3/2  - 1 

P(b(t)  S  J)  S  2  .  exp(-  2  Jt) 


S  23/2  exp(-  2m  J). 


The  right  hand  side  of  (3-7)  ia  bounded  by  the  sum 

m- 1 
I 

j-0 

,m-j 


S,  =  l  2  j  exp(-  2m  J). 


Since  -  2  £  -  (m-j)2  it  follows  that 


where 


,/p  m- 1  _  .  .  _/p  m-1 

S,  S  2J  22  J[exp(-2)]m  J  =  25  ^  exp(-2m)  £ 
J-0  j-0 

S  (23/2  /  [ ( e2/2) -1 ])  2 S  c/t 


S/p 

a=  2  /  [ ( eJ/2) - 1 ]  =  2.0993859. 


Finally,  putting  these  estimates  together  we  obtain 


E(  2 


-b(t) 


(3.8) 


)  2  JJ. 0993859  /  t. 


We  conclude  this  part  of  the  paper  by  establishing  a  weak  form  of  the 
conjecture  (1.7).  More  precisely  set  m(t)  =■  E(2  b^)  an(j  apply  lemma  (2.3)  to 
the  function  f(j)  »  j.  Since  Af(j)  -  2  ^  we  see  at  once  that 

E(b(t)  -  Z  2_b(3))  =  0 
s=1 

and  therefore 


t-1 


■b(s) . 


(3.9)  Z  E(2  “v°')  =  E( b( t ) )  <  log2 ( t+2)  =  log( t+2)  /  log2. 

s»1 


,~b(  t ) 


On  the  other  hand  as  we've  already  seen  E(2  )  >  1/t  which  implies 

t-1 


(3.10) 


t-1  .  .  t-1  .  . 

z  e(2  {s))  >  z  1/3  *  1  ♦  ;  ^  >  1  ♦  log(t-i) 

3=1  S=1  1 


Combining  (3.9)  and  (3-10)  yields 


I  E(2_b(s)) 


(a)  lim  sup 
t  -* 


s«1 


log  t 


S  1/log2 


(3.1D 


Z  E(2_b(s)) 


(b)  lim  inf 
t  •+  <*> 


log  t 


>  1  . 


Note  that  if  in  fact  E(2  b(“')  c/s  then  Z  E(2  b(s) 


)  -  c  log  t.  This  is 


s=  1 


why  we  call  (3.11)  the  weak  form  of  conjecture  (1.7). 


IV  -  AN  ESTIMATE  FOR  THE  TAIL  OF  THE  DISTRIBUTION 


Using  the  result  that  E(2  b^t^) 
inequality  it  is  easy  to  see  that 


n 

O 


(3/2) 


t(t*3) 


and  Chebychev' s 


log 


A)  =  0(2  26t’ 


'),  A 


0. 


( •'<  •  1  ) 


P  (  b  ( t )  > 


In  fact 


P(b(t)  *  2  log j  t+6) 


P(22b(t)  2  226f)  S  s  C|  2-26.t-2 


,26 


t" 


where  ct  is  independent  of  6  and  t.  This  is  in  fact  a  considerable  strengthening 
of  Flajolet*  s  proposition  4  wherein  he  shows  that 


P(b(t)  -  21og2 1  ♦  5)  =  0(2  6t  •"), 


uniformly  in  t  and  6  £  0. 


«N  y, -*v  aV»\>  .*/-V*»>  .v 
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